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Abstract 



We prove an analogue of the classical ballot theorem that holds for any mean zero 
random walk with positive but finite variance. Our result is best possible: we exhibit 
examples demonstrating that if any of our hypotheses are removed, our conclusions 
may no longer hold. 



1 Introduction 



The classical ballot theorem, proved by iBertrandl (118871 ). states that in an election where 
one candidate receives p votes and the other receives q < p votes, the probability that the 
winning candidate is in the lead throughout the counting of the ballots is precisely 

P-Q 

p + q' 

assuming no one order for counting the ballots is more likely than another. Viewed as a 
statement about random walks, Bertrand's ballot theorem states that given a symmetric 
simple random walk S and integers n, k with < k < n and with n and k of the same parity. 



P {Si> OW <i < n\Sn = k} 



k 
n 



The standard approach to extending Bertrand's ballot theorem is most easily explained by 
first transforming the statement, letting S[ = i — Si for z = 1, 2, . . . , n. S" is an increasing 
random walk, and the classical ballot theorem states that 



V {S[<i^ Q <i <n \ S'^ 



n 



n 



One may then ask: for what other increasing stochastic processes does the same result hold? 
This question has been well-studied; much of the seminal work on the subject was done by 
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Takacd ( 1962a b e. 1963 . 1964a b. 1967 ). The most general result to date is due to Kallenberg 
fll999h fsee also flKallenberd . l2003l . Chapter 11)). 



If, rather than transforming 5* into an increasing stochastic process, on takes the fact that 
Sn/ \/n converges in distribution to a normal random variable as a starting point, a different 
generalization of Bertrand's ballot theorem emerges. It turns out that if EX = and 
< E {X^} < oo, then for a simple random walk S with step size X, 

P {^i > V 1 < i < n\Sn = k} = Q for all k with < A; = O(v^). (1) 

(This is a slight misrepresentation; when we consider random variables that do not live on 
a lattice, the right conditioning will in fact be on an event such a.s {k < Sn < k + 1} or 
something similar. For the moment, we ignore this technicality, presuming for the remainder 
of the introduction that X is integer- valued and P {X = 1} > 0, say.) 



Furthermore, and (to the authors) more surprising, it turns out that this result is essentially 
best possible. We provide examples which demonstrate that if either E{X^} = cxd or 
k 7^ 0{^/n), no equation such as ([T]) can be expected to hold. The philosophy behind 
these examples can be explained yet another perspective on ballot-style results. One may 
ask: what are sufficient conditions on the structure of a multiset iS of n numbers summing to 
some value k to ensure that, in a uniformly random permutation of the set, all partial sums 
are positive wit h probabili t y of order k / nl This latter perspective is philosophically closely 
tied to work of lAndersenI (Il953l . Il954l ). ISpitzerl (119561 ) and others on the amount of time 
spent above zero by a conditioned random walk and on related questions. Our procedure for 
constructing the examples showing that our result is best possible is: 



1. construct a multiset }A (whose elements sum to fc, say) and for for which the bounds 
predicted by the ballot theorem fail; 

2. find a random variable X and associated random walk S for which, g iven that S^i — k^ 
the random multiset {Xi, . . . , X„} is likely close to in composition. 



The details involved in carrying out this procedure end up being somewhat involved. 



Outline 



In Section [2] we prove our positive result, the formalization of ([T]). Section [3] contains the 
examples which show that ([T]) may fail to hold if either E {X^} = cxd or if /c 7^ 0{\/n). Finally, 
in Section H] we briefiy discuss directions in which the current work might be extended and 
related potential avenues of research. 
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2 A general ballot theorem 



Throughout this section, X is a mean zero random variable with < E {X^} < oo and S is 
a simple random walk with step size X. Before stating our ballot theorem, we introduce a 
small amount of terminology. We say X is a lattice random variable with period c? > if there 
is a constant z such that dX — ^ is an integer random variable and d is the smallest positive 
real number for which this holds; in this case, we say that the set Lx = {{n + z)/d : n & 
is the lattice of X. A real number A is acceptable for X if A > l/ci (if X is a lattice random 
variable), or A > (otherwise). 

Theorem 1. For any A which is acceptable for X , there is C > depending only on X and 
A such that for all n, for all k > 0, 

P{k<Sn<k + A,Si>0\/0<t<n}< ^"^^^1^'^} ^ 

n ' 

and for all k with with < k < y/n, 

max{A;, 1} 



I'{k<Sn<k + A,Si>0\/0<i<n}> 



Before proving Theorem [H we collect a handful of results which we will use in the course of 
the proof. First, we will use a rather straightforward result on how "spread out" sums of 
independent identically distrib uted random variables become. The version we present is a 
simplification of Theorem 1 in iKesteru (119721 ): 



Theorem 2. For any family of independent identically distributed real random variables 
Xi,X2,... with positive, possibly infinite variance and associated partial sums 5*1, 5*2,..., 
there is c> depending only on the distribution of Xi such that for all n, 



sup 



P{x < ^„ < x + 1} < c/v^. 



We will also use the following lemma. Lemma 3.3 from ( jPemantle and Pered . Il995l ): 



Lemma 3. For h > 0, let be the first time t that St < —h. Then there are constants 
Ci, C2, C3 such that for all n: 

(i) for all h with < h < ^/n, P {T^ > n} > Ci ■ max{/i, l}/y/n; 
(a) for all h with < h < ^Jn, E {^S'^^T^ > n} < Cin; and 
(Hi) for all /i > 0, P {T^ > t^} < C3 ■ max{/i, \ \l ^Jn. 



(In ( Pemantle and Peres . 19951 ). (ii) was only proved with /i = 0, but an essentially identical 
proof yields the above formulation.) We will use the following easy corollary of Lemma [3] in 
proving the lower bound of Theorem [H 
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Corollary 4. There exists e > such that for all n and all h with < h < y/n, 

E {Sl I Th > n, Sn > ev^} < 3c2n. 

Proof. Choose e > such that for all n, P {S'„ > e^/n} > 1/3. By the FKG inequality, 
P {Sn > e^/^ I To > n} > 1/3. We thus have 

P {Sn > \To> n} 

< 3E {Sll^s^^,^ I To > n} < 3E {S^ | To > n} , 

and Lemma [3] (ii) completes the proof. □ 
Finally, in proving the lower bound of Theorem [Tl we will use a local central limit theorem. 



The following is a weakening of Theorem 1 from IStond (119651 ) . 

Theorem 5 ( Stond ( 19651 )). Fix any c > 0. // EX = and < E {X^} < oo then for any 



h > 0, if X is non-lattice then for all x with \x\ < Cy/n, 

, . -x2/(2nE{x2}) 

V{x<Sn<X + h} = (1 + 0(1))- 



v/27rE {X2}n 
and if X is lattice then for all x G Lx with \x\ < c-Jn, 

-2:2/(2nE{x2}) 

P{^„ = x} = (l + o(l))^__. 
In both cases, o(l) — > as n ^ oo uniformly over all x in the allowed range. 

Proof of Theorem [II We first remark that when X is a lattice random variable with period 
d^ii A = 1/d then [A;, k + A) contains precisely one element from the lattice of X. To shorten 
the formulas in the proof, we assume that X is indeed lattice, that A = 1/d., and that k is 
in the lattice of X, so that k < Sn < k + A ii and only if Sn = k. The proof of the more 
general formulation requires only a line-by-line rewriting of what follows. 

Throughout this proof, ci,C2, and C3 are the constants from Lemma [3] and e is the constant 
from Corollary |H We first prove the upper bound. We assume that n > 4. Let S**" be the 
random walk with 5*5 = and, for i with < z < ra, with S*"*"^ = SI — X„_j. Define To as in 
Lemma [3] and let T^ be the minimum of n and the first time t that SI < —k. 

In order that Sn = k and 5*4 > for all < z < n, it is necessary that 
. To> K4J, 
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^;+A> K4j,and 



By the independence of disjoint sections of the random walk and by Lemma [3] (iii), we have 

P {To > [n/A\ , n > ln/A\ } = P {To > [n/A\ } P > [n/A\ } 

C3max{A;, 1} 8c3max{A;, 1} 

We next rewrite the condition Sn = k as 

5'[3n/4] — Sln/A] = k — S'l„/4J — 5'[„/4j. 

Since S'|-3„/4] — Sin/i\ is independent of S^n/4\, of Sl^n/4\^ of the events {To > L^/4J} and 
{T[+A > L^/4J}, we have 

P {S„ = A; I To > [n/A\ , T[ > [n/A\ } < sup P - ^K4J = r} ■ 



By Theorem [21 we thus have 

P {S,. ^ . I n > [n/4J . n > [n/4J } < ^^^^^^l ^^^^^ < ^. (3) 

where c is the constant from Theorem O Combining ([2]) and (j3]) proves the upper bound. 

Fix a with 0<a< 1/2 so that 4(1 — 2a)E {X^} < e^, where e is the constant from Corollary 
m In order that Sn = k and Si > for all < i < n, it is sufficient that the following events 
occur: 



Ei: To > [an\ and e^yn < Sian\ < v^6c2n, 
E2: Tl > [an\ and e^/n < ^[^^j < y/Qc^, 

> —Sian\, and 

-E4: Sn = k. 

By the independence of disjoint sections of the random walk, we therefore have 

P {S„ = A;, > V < 2 < n} > P {El} ■ P {^2} ■ P {^3, ^4 I ^1, ^2 } • (4) 
By Lemma [3] (i), we have P {To > [anj} > ci/^/n, so by the FKG inequality we see that 

P {To > [anj, ^Lanj > e^/n] > ^p-=. (5) 



By applying Corollary H] and Markov's inequality, we also have 

6c2n ~ 2 



T3 / c ^ fa I ^ I I c ^ ^1 ^ E I To > \an\ , S^n > en} 1 

P |>5L"nJ > v6c2n I To > [an\,Sian\ > eV^j < < -, (6) 



and ([5]) and (j6]) together imply that 



P{E,}>-p^. (7) 



An identical argument shows that P {E2} > c\ ■ maxj/c, l}/(6A/n). By (jlj), to prove the 
lower bound it thus suffices to show that there is 7 > not depending on n or on our choice 
of k and such that P {Ej,, \ Ei, E2} > j/y/n; we now turn to establishing such a bound. 

Let m = \{1 — a)n \ — [an\ . For 1 < z < m, let 

Li = >S'[Q,„J_|_j — S^ctn]} S'Hd let Ri = S^(^i-a)n]-i ~ S^(^i-a)n]-i 

so in particular = —Rm- Next, rewrite the event -E4 as 

Lm = k — S\^ctn\ ~ ^[an] ■ 

By the independence of disjoint sections of the random walk, we then have 

P{E3,E4 I ^1,^2} > inf \p{Lm = -p+{k + q), mm L,> -p}\ .{8) 

p,qe[e^,y^6E^nLx { l<i<m J 

For p and g as in ([8]), we define the following shorthand: 

• Ap^q is the event that Lm = —p + {k + q), and 

• Bp is the event that mini<j<m Li > —p. 

We bound P {Ap g, Bp} by ffist writing 

P {Ap,g, Bp} > P {ApJ - P {Ap^g, B;} . (9) 



By Theorem [5l for all k with < A; < ^/n and all g G [e^/n, v^6c2n] fl Lx, 

-ik+q-pf/{2a^m) 

P {Ap,g} = (1 + 0(1)) , (10) 

where o(l) as n ^ 00, uniformly over all k, p, and q as above. To bound P {Ap^g, Bp^ 
from above, we ffist further divide the events Ap^g, Bp. Let m' = \n/2\ — [_an\. Observe 
that Ap^g occurs if and only if R^ = —{k + q)+p. Similarly, if Ap^g occurs, then for Bp to 
occur one of the following events must occur: either 

1. mini<j<m' Li < —p (we call this event Cp); or 
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2. mini<j<m_m/ Ri < —{k + q) (we call this event Dg). 



Thus, 

P {Ap,g,B;} < P {Cp, L^ = ~p+ik + q)} + P {D„ Rm = -{k + q) + p} . (11) 
By Kolmogorov's inequality and our choice of a, 

P{C}< = < E{Xn((l-2a)n + l) ^ 1^ ^^2) 



for all n sufficiently large. Furthermore, since for any i with 1 <i < m', Lm — Li is a sum of 
m — i>m — m'> [(1 — a)n] — [n/2j copies of X, by the independence of disjoint sections 
of the random walk and by Theorem [5l 

P{L^^ = -p+(k + q) \U<-p} = (l + o(l))— == 

y 27rcr"^(m — t) 

< (1 + 0(1)) , (13) 

where o(l) ^ as n — > cx), uniformly over all i, k,p, and g as above. It follows by Bayes' 
formula that 

P {Cp, = -p + {k + q)} < P{Cp}- m8.xP{L^ = -p+{k + q) \ Li<-p} 

l<i<m 
^-{k+q-pf/{2cr^m) 

4v^27ra%i 



A similar calculation shows that 

-(fe+g-p)V(2tT2m) 
72 



P {D„ R^ = -{k + q)+p}<(l + 0(1))^—-=^^, (15) 



"m 

and combining ([9]), (fTOl) . (fT4|) and (fTSjl . since m and n have the same order we see that 

P 5p} > (1 + 0(1))——=^^ > ^, (16) 

2v27rcr^m 

for all /c, p and q in the allowed ranges, for some 7 not depending on /c, p, g, or n. Together 
(IE]) and ( fT6|) establish the required lower bound on P{i?3,i?4 | Ei,E2} and complete the 
proof. □ 



3 Counterexamples 

In Section 13.21 we exhibit a random walk S with mean zero step, finite variance step size 
X and show that with k = n, P {Sn > V < i < n\Sn = k} is not Q{k/n) = 0(1) as the 
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ballot theorem would suggest. This example may be easily modified to show that we can 
not in general expect a result of the form P {Sn > < i < n\Sn = k} = Q{k/n) for any 
k = Q{^/n\ogn), and we believe that with a little effort it should be possible to make this 
approach work for any k = uJ{^/n). 

In Section [373] we exhibit a random walk S with step size X for which X is an integer random 
variable with period 1, and for which EX = 0, E jX^/^"*^} < 00 for all e > 0, but such that 
P{Sn > \/ < i < n \ Sn = ^/n} is not 9(n~^/^); the same idea can be easily modified 
to yield a random variable X with EX" < 00 for any fixed a < 2 and for which the ballot 
theorem can be seen to fail even in the range Sn = 0{y/n). 

The ideas behind our examples are most easily explained from the multiset mentioned in 
the introduction. The "underlying multiset" S for our example showing that the condition 
k = 0{n) is necessary consists of {n — l)/2 elements of value 1, the same number of elements 
of value — I's, and a single element of value n. The elements of S sum to n, and in order 
for all partial sums to stay positive, it is necessary and sufficient that the partial sums 
not containing the element n stay positive (as all partial sums containing n are certainly 
positive). Denoting the elements of iS by xi, . . . , x„ and letting a be a uniformly random 
permutation of {l,...,n}, the index i for which Xrj{i) = n is uniform among {l,...,n}. 
Letting Sa{i) = X]j=i ^o-(i) for < i < n, we may thus write 



1 " 

P >OVO<i<n} = --^P {Saii) > V < i < = n] 

^ i=i 
1 " 

= - ■ ^P > V < 2 < j>^(,) = n] . (17) 



n . 



For a symmetric simple random walk 5", it is well-known (see, e.g., iFellerl ( 1l968al ). Lemma 
IIL3.1) that for integers j > 0, P {5*; > V < z < j} = 0(j-^/2) Making the hopefully 
plausible leap of faith that the same bound holds for P {Sa{i) > V < i < j|xo-(j)=n}, (ITTI) 
then yields that 

p{s,,.,>ovo<.<„} = i.|:o(^)=o(-L). 

not 6(1) as the ballot theorem would suggest. Turning this intuition into the example of 
Section 13.21 is a matter of finding a random walk S with mean zero step size X G for 
which, given that Sn = n, the set {Xi, . . . ,X„} very likely looks like the set S above, i.e., 
there is a single index i for which Xj = n, and for all other j, Xj is "small". 

For the example showing that the condition E{X^} < 00, "underlying multiset" we are 
thinking of consists of roughly (n — n^/^)/2 elements of value 1, the same number of elements 
of value +1, (n^/'^ + l)/2 elements of value y/n, and (n-^/^ — l)/2 elements of value —y/n. 
These elements sum to y/n. 
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For all partial sums in a uniformly random permutation of this multiset to stay positive, it 
is necessary that the partial sums stay positive until an element of value -y/n is sampled - 
this should occur after about n^/^ elements have been sampled, so the intuition given by a 
symmetric simple random walk suggests that the partial sums stay positive until this time 
with probability of order ©(n-^^/^^'^^Z^)) = 0{n-^/^). 

In order that the partial sums stay positive, it is also essentially necessary that the "sub- 
random walk" consisting of the partial sums of only elements of absolute value \/n stays 
positive - for if this "sub-random walk" becomes extremely negative then it is very un- 
likely that the original partial sums stay positive. Dividing through by n^/^ we can view 
this "sub-random walk" as a symmetric simple random walk S' of length n^/^, conditioned 
on having 5'[^i/4 = 1- By the ballot theorem, the probability such a random walk stays 
positive is 0(l/n^/'^). Combining the bounds of the this paragraph and the previous para- 
graph as though the two events were independent (which, though clearly false, gives the 
correct intuition) suggests that the original partial sums should stay positive with probabil- 
ity 0(l/n^/^+^/'^) = 0(l/n^/*), not 6(n"-^/^) as the ballot theorem would suggest. 

Before we turn to the details of these examples, we first spend a moment gathering two easy 
lemmas that we will use in the course of their explanation. 



3.1 Two Easy Lemmas 



The first lemma bounds the probab i lity tha t a random walk stays above zero until some time 
m, and is a simplification of iFellen (jl968bl ). Theorem XII. 7. 12a. 



Lemma 6. Given a random walk S with step size X, if X is symmetric then for integers 
m > 0, 

P {Si> OW <i <m} " 







The second lemma is an easy extension of classical IChernofj (119521 ) bounds to a setting in 
which the number of t erms in the b i nomia l is random. The classical Chernoff bounds (see, 
e.g., (2.5) and (2.6) in Janson et al. ( 20001 ) for a modern reference), state: given a binomial 
random variable Bin(n,p) with mean fj, = np, for all c > 0, 



P {Bm{n,p) > (1 + c)/i} < e-^'^^/^'^^+^Z^^)) 
P {Bm{n,p) < (1 - c)/i} < e^"''^/^ 



(18) 
(19) 



The following lemma follows from the Chernoff bounds by straightforward applications of 
Bayes' formula: 

Lemma 7. Let m be a positive integer, let < q < 1 , and let U be distributed as Bin(m, q). 
Let V be a positive real number and let Vi^ V2, . . . be i.i.d. random variables taking values ±.v, 
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each with probability 1/2. Finally, let Y = Vi + . . . + Vu. The for all t > 0, 



P {y > t} < exp 



t^ 



+ exp |-^}, and 



P{Y<-t} < exp{-^}+exp{-!^}. 



Proof. As E[/ = mq, by f|T8l) we have 

[2mgJ 

P{Y>t} < ^ P{r >t|f/ = M}P{f/ = M} + P{f/>2mg} 

< sup P {Y > t\U = u} + P {U > 2mq} 

u<2mq 

< sup P{F > t|f/ = n} + 6""^ (20) 

u<2mq 

Given that U = u, {Y + uv)/2v, which we denote Y', is distributed as Bin(u, 1/2). Further- 
more, in this case Y > t if and only if 

u t u f_, t 
Y'>- + — = -{l + — 
2 2v 2\ uv 



u f t V f 2t ^ ^ 



It follows by ([T8]) that 



P{Y>t\U = u} < exp{ — — 2 + 



= exp 
< exp 



2 \uv J \ 3uv 



t^ 



2 



8mg + ' ' 



where in the last inequality we use that u < 2mq and f > 1. As m was arbitrary, combining 
this inequality with (120|) yields the desired bound on P {F > t}. The bound on P {F < —t} 
is proved identically. □ 



3.2 Optimality for normal random walks 

Let / be the tower function: /(O) = 1 and f{k + 1) = 2^^'^"' for integers k > 0. We define a 
random variable X as follows: 




±1, each with probability | ■ (^1 — Xli^o J^J 
±f{k), each with probability jfTW^ for /c = 1, 2, . . . 
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and let S he a random walk with steps distributed as X. Clearly EX = 0, and it is 
easily checked that E{X^} < 2. We will show that when n = f{k) for positive integers 
k, P {Si > VO < i < n\Sn = n} is 0{l/^/n), so in particular, for such values of n this 
probability is not 0(1) as the ballot theorem would suggest. 

For z > we let Ni be the number of times t with 1 < t < n that \Xt\ = f{i)', clearly 
Yl'jZo ~ ^- ■^'-'^ Sn = n to occur, it suffices that for some t with 1 < t < n, Xf = n and 
Sn — Xf = 0] therefore 

P{Sr. = n}>j2P{Xt = n,S..-X, = 0}>j2 ^^^--J' = ^\ 

t=i t=i 

As EX^ < oo and, for all 1 < t < n, S'^ — Xt is simply a sum of n — t independent copies of 
X, by Theorem [5] we know that P — = 0} = B(n~^/^) uniformly over 1 < i < n. It 
follows that 

1 



P{Sn = n} = n^-^j. (21) 

We next bound the probability that Sn = n and St > for all < t < n; we denote this 
conjunction of events E. Our aim is to show that P {E} = 0(n~^), which together with 
(I2T1) will establish our claim that P {Si > V < i < n\Sn = n} is 0{l/^yn). Recalling that 
f{k) = n, we write 

P{E} = P{E,Nk = 0} + P{E,Nk>l}. (22) 

It is easy to show using Chernoff bounds that P {E,Nk = 0} = 0{n~^) (we postpone this 
step for the moment). From this fact and from (!22l) . we therefore have 

P{E} = P{E,Nk>l} + 0{n~'^), 

from which it will follow that P {E} = 0{n''^) if we can show that P {E, > 1} = 0(n^^). 
We first do so, then justify our assertion that P {E, = 0} = 0{n^^). 

For E and {Nk > 1} to occur, one of the following events must occur. 



• For some t with 1 < t < [n/2\ , Xt = ±n, Si > for all < i < t, and Sn = n. We 
denote these events Bt, for 1 < t < \n/2\, and remark that they are not disjoint. 

• 5*4 > for all < z < [?^/2j , and for some t with \n/2\ < t < n, Xt = ±n and Sn = n. 
We denote these events Dt for [n/2\ < t < n; again, they are not disjoint. 



We first bound the probabilities of the events Bt, 1 <t < \n/2\. Fix some t in this range 
- by Lemma El the probability that Si > for all < i < t is 0(t^^/^) uniformly in t. By 
definition, P {Xt = ±n} = n~^, so by the strong Markov property. 



P{S^>OWO<i<t,Xt = ±n} = 
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still uniformly in t. Furthermore, by Theorem [2] we have that 



supP{5„ -St = r} = 0{ -yL=] = o(^]. (23) 



y/n-t J \^/n 

As Sn — St and n — St are independent, it follows from ( l23l) that P {Sn — St = n — St} 
0(n~^/^), and by another application of the strong Markov property we have that 

P {Bt} = P{S,>OyO<t<t,Xt = ±n, S.^-St = n-St} = ^ 



Thus, 

{L"/2J 1 ln/2\ L«/2J / T \ / 1 \ 

y.,} . gP™,go(-^)^o(i). (.4) 

We next bound the probabilities of the events Dt, [n/2\ < t < n. By an argument just as 
above, we have that 

P {5, > VO < z < [n/2\,Xt = ±n} = ^ 



^9/2 ) ■ 

Also just as above, since Sn — Sin2\ ~ and n — S'^^jj — Xt are independent, 

P [Sn — 5'l„2j — Xt = n — S'l„2j — Xt] = O ■ 

By the independence of disjoint sections of the random walk we therefore have that P {Dt} = 
0{n~^), and so 



t=[n/2\+l 

AsEn{Nk > 1} is contained in Uj!!f ^ Bt U Ur=Ln/2j+i ^t, it follows from ^ and (ESI) that 
P {E,Nk>l} = 0{n-^) as claimed. 

We now turn our attention to proving that P {E, = 0} = 0{n^^). We will in fact show 
that 



P = 0, > 8k^/n\ogn^ = O (^-^^ 



(26) 



which implies the desired bound. We recall that Ni is the number of times t with 1 <t < n 
that Xt = f{i). For < i < A;, let Sn,i = J2{i<j<n:\Xj\=f(i)} ^j- If A^fc = then either 
Sn = YliZi Sn,i or Nj > for some j > k; thus. 



fc-i 



{Nk = 0, > 8k^n\ogn} C <j ^ A^,- > ^ U [j{\Sn,i\ > S^nlogn}. (27) 



12 



For any 1 < i < k, S'„ j is the sum of Ni i.i.d. random variables taking values ^g{i), each 
with probabihty 1/2, and is distributed as Bin(n, (yf(i)~^). By LemmaEJ therefore, 

P{|S„,|>8A1^} < 2expj-^-:^^pi^j+2exp|^}. 

Since g{i) < logn, presuming n is large enough that 32^/n\ogng{i) < n we thus have 

f{\S.A>sV^.} < 2exp{^}+2e.p{^}^0(i). (28) 

Furthermore, it follows directly from the definition of X that P |X]jl/c+i > o| = o(2~"). 
Applying this fact, (l28l) . and fl27|) . it follows immediately that 

P {iVfc = 0, 1^,1 > 8k^/n[^^ = o(2"") + O (^-^^ = O 

as claimed. 




3.3 Optimality for other random walks 

Let : N — N be an rapidly increasing integer-valued function with g{0) = 1; in particular, 
we choose g such that g > f where / is the tower function seen in the previous section. 

^ _ I ±1, each with probability | (l - E^o -^(^) 

1 ±g{k), each with probability 2g{kyi/^ ' for /c = 1, 2, . . . 

and let 5* be a random walk with steps distributed as X. Clearly EX = 0. For integers 
i > 0, let Posi be the event that Sj > for all < j < i; we also let Posq be some event 
of probability 1 as it will simplify later equations. We will show that when n = g{kY for 
positive integers A;, 

(29) 



and that 

/ log-13/2 „ 

P {Sn = v^,Pos4 = O ^ I , (30) 



n 



from which it follows by Bayes' formula that for such values of n P{Pos„|S'„ = \/n} is 
0{\o^ n/'TV'/^), not Q{l/y/n) as the ballot theorem would suggest. We now prove (|29l) and 
(1301) . In what follows we presume, to avoid cumbersome floors and ceilings, that g{k) = ^Jn 
has been chosen so that A/log n and n^/^ are both integers. 
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For j = 1,2, . . . and z = 0, 1, . . ., we let Xj^i be the random set {X^ '■ 1 < m < j, \Xm\ = dii)}, 
let Nj^i = and let Sj^i = ^Xm&x -^fn- Fo^^ J = 1)2,..., the sets Xjo, Xj^i, . . . 

partition {Xi, . . . , X^}. For an integer A; > 0, the k-truncated random walk S^'^^ is given by 

1=1 i=l 

for j = 1,2,.... We remark that for any n and any set X C {Xi, . . . , X„}, conditional upon 
the event that IJi=i '^n.j = is simply a sum of \X\ i.i.d. bounded random variables 

with variance at most g{kY- In particular, this implies that after such conditioning, Sn"^ 
obeys a local central limit theorem around 0. The key consequence of this fact (for our 
purposes), is that we may choose g to grow fast enough that there exists a small constant 
e > such that: 

V A; > 0, M n >n>g{k + l), V A" C {Xi, . . . , X„/} s.t. \X\ = n, 
P\si^^ = {Xr.l<^<n',\X,\<g{k)} = x}>—^■, (31) 

such a constant is guaranteed to exist by Theorem [5] and our above observation about the 

(k) 

conditional distribution oi Sn ■ 

Fix some integer A; > 1 and let n = g{kY . We remark that EX^^fc = nP {X = g{k)} = n^^^ /2. 
For Sn = \fn to occur, it suffices that Sn^h = g{^^ = and that Sn — Sn,k = 0. For any 
subset /C of 5 = {Xi, . . . ,X„}, Sn,k and S'„ — Sn,k are conditionally independent given that 
= fC. Letting Z be the set of subsets of S of odd parity and of size at most we 
then have 

Y'{Sn = \/n] > P{Sn,k = g{k),Sn- Sn,k = 0} 

> P {Sn,k = g{k), Sn - Sn,k = 0, < 2n'/\ Nn, odd} 

> f inf P {Sn.k = g{k), Sn - Sn,k = 0\Xk = /C} V P {X„„, < 2n'/\ Nn,i odd} 

\x,iV{Sn.k = g{k)\X^ = 10^V{Sn-Sn,k = ^\Xk = 10^ ■ Q-o(l)) . 

(32) 

by the aforementioned independence and a Chernoff bound. 

To bound this last formula from below, fix an arbitrary element /C of Z. Note that Sn — Sn,k = 
Sit~'^^ unless there is i > k such that Xi ^ 0. Since P{|X| > g{k)} = 0{g{k + 1)^/2) = 
0(2-3«/4)^ it is easily seen that P {[jZk^^ = ^\^k = /C} = 1 - o(2-"/2)_ Thus, by Bayes' 
formula. 



P {Sn — Sn,k — 0\Xk — /C} — P < — Sn,k — 



UA'fc = A'fc = /C l(l-o(2-"/2)) 



i=k 



p I si!'-^^ = 



\JX, = Xk = )c\{l-o{2~-/^)). (33) 
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Letting X = {Xi, . . . , X„} — /C, the previous equation implies that 

P {Sn - 5n,fc = 0\Xk = IC} = n{P {S^^-'^ = I {X, : 1 < ^ < n, |X,| < g{k -1)} = A:}). 

(34) 

As |/C| < 2n^/^ \X\ =n-\}C\ >n-2n^/^ > n^/^ = g{k), so applying dM]) in ([31]) yields that 



\g{k-l)y/n-\}C\J \g{k-l)y/n) 

Next, for any set K, & given that Xk = ^i it follows directly from a binomial approximation 
that P{^„,fc = g{k)\\Xk\ = /C} = ^](|/C|-l/2) = Q{n-^/^). Plugging this bound and ((351) into 



(1321) yields that 
establishing ( l29i) . 

We next turn to our upper bound on P {^^ = y/n, Pos„}. We shall define several ways in 
which the walk 5* can "behave unexpectedly". We first show that the walk is unlikely to 
behave unexpectedly; it will be fairly easy to show that given that none of the unexpected 
events occur, the probability that {Sn = v^} ^"^^ PoSn both occur is 0(log^ n/ra^/^). Com- 
bining this bound with our bounds on the probability of unexpected events will yield f l30p . 

We first describe and bound the probabilities of the so-called "unexpected events". Let B 
be the event that there is i with 1 <i <n for which \Xi\ > g{k). By a union bound, 

P{B} < nP{|Xi|><7(A;)}=n( P{Xi = ^(0}) =n( — 

\i=k+l / \i=k+l ' 



= "O^^j-o^^j (37) 

Next, let T be the first time for which Xt = g{k) = ^pn. Letting t* = bn^^'^logn, we have 

P {T > r} < P |n{|X,| ^ v^}| = (l - j = O j (38) 

Now, by another union bound, 

P \St-i > 8ky/t*logn,T < t*,B] < t* sup P \\St-i\ > 8ky/t* logn,T = t, b] (39) 



If {T = t} and B occur, then St-i = Yl'i=i St~i,i- Thus, by an argument just as we used to 
prove (l26l) . we can see that 



p{\St-i\ > 8k^t* log n,T = = O 



(40) 



15 



which, combined with (l39l) . yields that 



P {\St-i\ > 8ky^FJ^,T < t*,B] = O ( J) = O . (41) 

Finally, for < t < n, let be the set of subsets of {t + 1, . . . , n} of size between and 
3n^^^/2. Let Z = Zq, and let {Ti, . . . ,Tr}, which we denote T, be the set of indices i with 
T < i < n foT which = y/n, ordered so that T < Ti < . . . < Tr < n. i? is distributed as 
Bin(n — T, n" -3/4), so by Bayes' formula, (fT8|), and (flQl), 

P{r^Z,T<f} = ^P{T^Z|T = t}P{T = t} 

< supP{r ^ z|T = t} = supP{r ^ Zt|T = t} 

i<t* t<t* 

^ / 1 \ / 1 \ 3ni/4 

= sup P < Bm n - t, < — — or Bm [n -t, -—j- > 



-0[^). (42) 

This completes our bounds on the "unexpected events". We next use these inequalities to 
bound P {Sn = \/n, Pos„}. Roughly speaking, in order that {Sn = -\/n} cind Pos„ occur, it 
is necessary that 



(a) S stays positive until time T, 

(b) The random walk S' given by SI = J2]=i -^t^ / Vn does not go "too negative" and 
additionally 15*^1 is not "too large", and 

(c) Sn - St - VnS'^ = y/n- St - VnS'j^. 

Though the event in (c) is precisely the event that Sn = y/n, we write it in this form in order 
to point out that once we have conditioned on fixed values for T and T, Sn — St — VnS'^ is 
independent of ^/n — St — \^S'j^. We now turn to the details of defining and bounding the 
events in (a)-(c). 

First, recall that t* = bn^^^logn. For any t < t*, by Lemma El 

P{PosT,T = t} < P{Post_i,T = t} 

< P{Post-i,\Xt\ = V^} 

= P{Post-i}P{\Xt\ = V^} 

- of^Vofil^of^) (43) 
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For any events E, F, and G, P {E} < P {E,F} + P [F], and P {E, F} < P {E, F, G} + 
P {F, G}. We now apply these bounds together with the bounds (|57l) . (1551) . and (HTi) . to see 
that for any event E, 

P{E,Post} < P {E, Post, B} + 0(2-''/'^) 

< P {E,Post,B,T <t*} + 0{n-^) 

< P ^^E,Post,B,T < t*, \St-i\ < 8A;v/t*logn| 

+p|5,T<r,|ST-i| > 8ky/t* \ogn^+0{n-^) 
= P ^^E,Post,B,T < t*, \St-i\ < 8k^/t*\ogn^ +0{n-^) (44) 

Continuing in this fashion using ( l42l) . ( 14^ . and the fact that 8ky/t* logn < 20n^/^log^^^ n, 
and letting j* = 20n^/* log^''^ n, we have 

P{F,Post} < P {E,Post,B,T <t*,\ST-i\ < j*,T e Z} 
+P{T ^ Z,T < t*} + 0{n-^) 

= P{E,Post,B,T <t\\ST-i\ <]\TeZ} + 0{n-^). 

t* 

= Y,P{E,PosT,B,T = t\\ST-i\<j\reZ} + 0{n~'>). (45) 
t=i 

By applying Bayes' formula and (H3l) . this yields 
P{F,Post} < 5^P{PoSi,T = t}P{F|PoSi,T = t,S,|^,_i| <j*,TGZ} + 0(n-^) 

= ^ (tI^) P ^ = I'^t-il < J*, TeZt} + 0(n-5)(46) 

Next, for any 1 < t < t* we have 

P {E\PosuT = t,B,\St-i\ < 3\r e Zt} < sup P {F|Post, T = t, 5, = s, T = J} , 

\s\<j'',1(^Zt 

which together with fH6l) gives 

P{F,Post} < J^Or^^) sup P{E\PosuT = t,B,St-i = s,T = l} + 0{n-'') 



O(^) sup sup P{£;|PoSi,T = t,5,5i_i = s,r = J} 

+0(n-^) 

^(^^'^l sup sup P{E\PosuT = t,B,S,^^ = s,T = Z}^0{n 

n^i^ ) i<t<t' \s\<j*,ieZt 



-5\ 



(47) 
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We will apply equation (H7|) with E the event {S'„ = n} fl Pos„. We first note that for a 
given t with 1 < t < t*, if \Xt\ = y/n and \St-i\ = s < j* < y/n, then for Post to occur 
necessarily Xf = y/n, so = y/n + s. 

Fix any integer t with 1 < t < t*, any integer s for which |s| < j*, and any I ^ Zt. We 
hereafter denote by Good the intersection of events 

Post n {T = t} n 5 n = s} n {T = J}, 

and by P'^{-} the conditional probability measure 

P{-\Good}. 

Given that {Ti, . . . , Tr} = X, i? is deterministic - say i? = r - and n^/^/S < r < 372^/^/2. 
We recall that 5" was the random walk with S'^ = J2]=i -^Tj / \pn. As previously discussed, 
given that Good occurs, St = a/^+ s. In order that {Sn = \/n} and Pos„ occur, then, it is 
necessary that either 

1. for some integer m with |m| < lOlog^n, S'^ = m, Sj > — lOlog^n for all 1 < j < r, 
and Sn — St — y/nSl. = —s — niy/n (we denote these events Bm for \m\ < lOlog^n), or 

2. there is j with t < j < n for which | "^iZi > lOy^log^n (we denote this event 
C). 

To see this, observe that if none of the events Bm occurs and C does not occur, then either: 

• IS*' I > lOlog^ra, in which case 

fe-i 

Sn > S'^y/n - I ^ Sn,i\ > S'^. - 10^71 log^ Tl > y/u, 

2=1 

SO Sn ^ \fn, or 

• there is j with 1 < j < r for which S'^ < — 10 log^ n, in which case 

fc-i 

St, < S'jy/^ + I J2 St.A < S'jV^ + lOV^log^ n<0, 
1=1 

so PoSn does not occur, or 

• S*', = m for some m with |m| < lOlog^n, but Sn — St ~ y/n + s ^ —s ~ niy/n, so 
Sn ^ Vn. 
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Thus, by a union bound, 

{/ \ 1 lOlog^n 

\|m|<101og2n / J m=-101og2n 

To bound the probabihties we first note that, denoting by the event that 

S'j. = m and Sj > —10 log^ n for all 1 < j < r, 

Bm = A^n {Sn - St - y/nS', = y/u - s - m^/u} . 

Furthermore, Am and {Sn — St — \/nSl = -Jn — s — ra-Jn) are independent as they are 
determined by disjoint sections of the random walk, so for all m with |m| < lOlog^n, 

V'{Bm} = {Am, Sn - St - V^S', = y/^ - s - m^/E} 

= P'{Am}P'{Sn-St-V^Sl = V^-s-m^} (49) 

Now, given that Good occurs, 5" is nothing but a symmetric simple random walk of length 
r; thus, by Bertrand's ballot theorem, 

P= {A„,} = O ^('"+101og'"+;K101of » + _ o ^1^^ ,50) 

Also, given that Good occurs, Sn — St — \/nS'^ is a sum of n — t — r = fl{n) i.i.d. integer-valued 
random variables that are never zero; Thus, by Theorem [2l 



P"" {Sn — St — VnS'j. = \pn — s — niy/n] = O ^"^^ 



(51) 



and combining ( 1491) . ( jSOll . and (1511) yields that 



P'{Brn} = 0C^]. (52) 



^7/8 

For any j with t < j < n, an argument just as that leading to (|26|) shows that 
so 

{C} < E 1 1 E ^i'^l ^ lOv^log^ ^1=0 , 



(53) 



and (HE]), dSD, and ([22]) together yield 

10 log^ n 



log^ 



n 



7/8 



m=— 10 log n 



(54) 
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Since t,s, and I E Zf were arbitrary, (H7|) and (15^ combine to give 

-\/logn\ /log^n\ / log^^^^ n 



5/4 



Finally, since if PoSn occurs then either Post occurs or T > n, by ( l38i) we have 
P {5„ = v^, Pos„} > P {S„ = v^, Pos„, Post} - P {T > n} 



as asserted in (j30l) . 



^5/4 / I ^5 y 1 ^5/4 



4 Conclusion 



The results of this paper raise several questions. While our examples show that Theorem [T] 
is essentially best possible, is it perhaps possible that a ballot theorem holds for real-valued 
Markov chains with finite variance? Also, one observation about the example of Section [X^ 
is that in that example, the step size X is not in the domain of attraction of any distribution. 
This leaves open the possibility that a ballot-style theorem may hold if the X has mean zero 
and is in the domain of attraction of some stable law. Such behavior seems unlikely but is 
not ruled out by our examples. 

Finally, it would be very interesting to derive conditions on more general multisets iS of n 
numbers summing to some value k which guarantee that, in a uniformly random permutation 
of iS, all partial sums are positive with probability of order k/n. Indeed, perhaps such 
work could end up not only generalizing the work of this paper, but perhaps unifying it 
with the existing discrete-time ballot theorems based on the "increasing stochastic process" 
perspective. 
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